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1.  INTRODUCTION 


It  is  appropriate  at  the  onset  that  we  devote  a few  paragraphs 
to  the  introduction  of  the  general  linear  hypothesis  model  of  full 
rank.  We  want  to  consider  uncorrelated  observations  y} , y 2,  •••»  Yn 
that  satisfy  the  relation 


y.  = I x..0.  ♦ e. , i = 1,  2,  ....  n 

i j=l  1J  J 1 


(1.1) 


and  are  linear  in  the  unknown  parameters  0^,  0.,,  ....  0p  with  known 

coefficients  x. . and  random  term  e.  satisfying 
ij  l 


E(yi)= 


and 


Var(yi)  = a . 


In  other  words,  the  random  term  e.  is  a random  variable  with  expected 

1 2 
value  E(e^)  equal  to  zero  and  unknown  variance  Var(e^)  equal  to  a . 

The  problem,  in  its  most  general  sense,  involves  determining  point  and 
interval  estimates  of  several  quantities  of  interest  of  the  model  and 
the  testing  of  various  statistical  hypotheses. 


For  compactness  of  notation  and  ease  of  manipulation  let 


Y = 

• ' 

yl 

y2 

. x = 

X11  X12 
X21  X22  • ‘ ‘ 

IP 

XT 

2p 

, 6 = 

f»i) 

*2 

, e = 

el' 

e2 

y"j 

X , X 0 ... 

nl  n2 

k 

X 

nPJ 

, 

e 

n 

. 

then  we  can  write  the  system  of  relations  (1.1)  as 

Y = X0  + e 

and  proceed  to  define  the  general  linear  hypothesis  model  of  full  rank 
as  follows: 


PRjJCLDINB  PASS  BLANK- 

_ _ 


Definition  1.1.  The  model  Y = X8  + e where  Y is  a random  observed 
vector,  e is  a random  vector,  X is  an  n x p matrix  of  known  fixed 
quantities,  and  6 is  a vector  of  unknown  parameters  is  called  the 
general  linear  hypothesis  model  of  full  rank,  provided  the  rank  of  X 
is  equal  to  p where  p < n. 

In  the  present  inquiry  we  restrict  our  consideration  to  the  nor- 
mal theory  case,  which  means  the  random  vector  e,  already  satisfying 

2 

E(e)  = 0 and  cov(e)  = a.,  will,  in  addition,  be  assumed  to  be  normally 
distributed. 


The  problem  most  frequently  addressed  is  that  of  estimating  the 

unknown  parameters  0.  on  the  basis  of  the  observations  y. . These 

J ~ t 

estimates  of  8-,  denoted  by  B-,  are  functions  of  y^;  and,  as  such,  are 

themselves  random  variables  about  which  confidence  intervals  can  be 
constructed.  These  ideas  are  fully  developed  in  a number  of  text- 


1  2 

books.  ’ A point  not  so  widely  expounded  is  that  the  usual  frequency 
interpretation  of  a confidence  interval  based  on  a single  sample 

yV  y 2’ 

data  are  used  to  determine  confidence  intervals  for  both  8^  and  Bj , 

i t j 

constructed  will  simultaneously  contain  8^  and  8- 

is  advanced  by  the  fact  that  the  interval  estimates  are  not  independent; 
so,  in  general,  only  a single  confidence  statement  can  be  made  from  a 
single  set  of  observations. 


yn  holds  only  for  a single  coefficient  Bj ; if  the  same 

>ol 

the  probability  is  not  1-a  that  the  confidence  intervals  thus 

The  complexity 


It  is  not  our  intent  here  to  address  this  problem  directly;  such 
an  inquiry  falls  into  the  general  area  of  simultaneous  confidence 
intervals.  It  is  our  intent,  however,  to  consider  a ramification  of 
this  problem:  namely,  the  construction  of  a confidence  envelope  about 
the  entire  regression  line.  We  will,  in  addition,  provide  some  results 
concerning  confidence  envelopes  about  combinations  of  regression  lines 
and  implications  of  their  use. 

3 

Toward  this  end  consider  the  following  definition  due  to  Bose: 


1 Graybill,  F.  A.,  An  Introduction  to  Linear  Statistical  Models, 
Volume  I,  McGraw-Hill  Book  Company,  Inc.,  New  York,  1961. 

2 Rao,  C.  R.,  Linear  Statistical  Inference  and  Its  Applications, 
John  Wiley  5 Sons,  Inc.,  New  York,  1965. 

3 Bose,  R.  C.,  "The  Fundamental  Theorem  of  Linear  Estimation", 
Proceedings  of  the  31st  Indian  Science  Congress,  1944,  pp.  2-3. 
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Definition  1.2.  A parametric  function  <p  is  called  an  estimable  function 
if  it  has  an  unbiased  linear  estimate,  i.e.,  if  there  exists  an  n-vector 
a of  constant  coefficients  such  that  F.(a'y)  = \p. 

If  L is  a p-dimensional  space  of  estimable  functions  with  basis 

{<J>.  , iK«  •••>  } and  is  the  least  squares  estimate  of  if  t L,  then 

1 z P 4 

we  have  the  following  theorem  due  to  Scheff6  . 


Theorem  I . 1 . Cinder  the  general  linear  hypothesis  model  (normal  case) 
the  probability  is  1 - a that  simultaneously  for  all  ip  c L 


where  the  constant  S = (pF^Cp.n-r)} 


1/2 


and  rank  X = 


r. 


The  implications  of  this  theorem  are  far  reaching;  and  in  this 
article  we  will  exploit  a single  facet,  albeit  an  important  and  useful 
one.  To  facilitate  this  we  need  to  be  aware  of  the  fact  that  since 

least  squares  estimates  B are  BLUE,  the  elements  of  the  vector  B of 
the  general  linear  model  of  full  rank  form  a basis  of  a space  L of 
estimable  functions  which  includes  polynomials  as  a special  case. 


2.  CONFIDENCE  REGION  FOR  A POLYNOMIAL 

To  determine  a confidence  region  for  a polynomial  with  observa- 
tional equations 


yi  = 60  + 


Vi  + Vi  + 


B x?  1 + e. , 
p-1  l i 


i = 1,  2,  . . . , n 


in  the  model  Y = X6  + e,  the  n x p matrix  X 
coefficients  takes  the  form 


(x. .)  of  known  constant 
ij 


4 ScheffC,  H. , The  Analysis  of  Variance,  John  Wiley  5 Sons,  Inc., 
New  York,  1959. 
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X = 


, 2 p-1 

1 X,  x1  . . . x^ 


, 2 p-1 

1 x,  *2  • • • x2 


1 x 


n n 


- 

The  least  squares  estimate  of  B is  given  by  B = (X'X)  X'Y.  If  we 
choose  i|/i  = B^  i = 0,  1,  ...  , p-1,  then  {i|k}  = (B^  is  a set  of  p 

linearly  independent  estimable  functions  which  forms  a basis  for  the 

space  L.  For  any  value  xq  denote  \'Q  = (1,  xq)  ...  , x^  *) . Clearly, 

E(yQ)  = X^B  e L and  has  least  squares  estimate 

-1  n 

re  = X^(X'X)  X'Y  = I a.yi 

where  the  coefficients  are  the  elements  of  the  1 x n vector 

X'(X'X)"1X'.  Thus 
o 

n 2 -1  -1 
I a = X' (X'X)  X' [X' (X'X)  X'] ' 

. ,i  o o 

i=l 


X(!)(X'X)"1X'X(X'X)"1Xo 


■1, 


= X' (X'X)  X 
ov  o 

so  that  a?  = o2X'(X'X)  *XQ  with  unbiased  estimate  s“X^(X'X)  *XQ. 

* 

From  Scheffg's  theorem  we  can  assert  with  probability  1-a  that 
simultaneously  for  all  e L and,  in  particular,  X^B  e L 


where  S = [pFa(p,n-p)] 


X'B  - So.  < X'B  < X’B  + S a. 
0 0 “ 0 * 

1/2 
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As  an  illustration,  suppose  the  paired  data  (1.20,  0.34), 

(1.37,  0.94),  (1.38,  0.99),  (1.65,  1.58),  (1.71,  2.08),  (1.82,  2.25) 

2 

are  characterized  by  the  quadratic  y = -0.31x  + 3.97x  - 3.95  over 

the  interval  of  interest,  1 < x < 2.  The  95%  confidence  region  for 
the  entire  true  line  is  given  by 


X'B  - (5 . 28) < X ’ 6 < X’B  + (5. 28)a„ 
o ,—  o — o , 

ip 


as  shown  in  Figure  1. 


Grubbs  showed  that  for  the  case  y = 6 + 6,x  the  confidence 

o 1 

bounds  resulting  from  ScheffS's  theorem  are 

1/2 


lQ  + 6xx  ± [2Fa(2,n-2)]1/2S 


1 n(x-x) 
n A 

xx  j 


(2.1) 


where  S = 


1 A * 2 
I (y.  - B -6.x.) 
n-2  . , w l o ii 
i=l 


1/2 


2 2 
and  A = nix.  - (lx. ] . 
xx  1 1 1 ‘ 


Note  that  the  value  x appearing  in  (2.1)  is  not  limited  to  an  x^  which 
appears  in  the  observations  (x^,y^),  i = 1,  2,  ....  n. 


3.  THE  TWO- SAMPLE  CASE 

Suppose  two  independent  sets  of  data  have  given  rise  to  two 
characterizations  of  the  same  phenomenon  so  that  we  are  now  confronted 
with  what  is,  in  essence,  two  models: 


and 


Y1  = *1^1  + ei  » an  x Pj  problem. 


Y9  = X262  + ej  , an  n2  x p2  problem. 


We  can  still  represent  this  situation  as  Y = XB  + e where  now 


5 Grubbs,  F.  E. , 
BRL  Report  No. 


Linear  Statistical  Regression  and  Functional  Relations, 
1842,  November  1975.  (AD  #A018651) 
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' Yx  xi  I 0 ei 

Y = , X 1 > 6= 


I X2  J 


e 


The  least  squares  estimate  is  given,  as  before,  by  6 = (X'X)  X’Y. 

Consider  now  the  difference  of  two  polynomials  y*  - y*2  e L with 
LS  estimate  X*  - X*  02  where  X?  = (1,  ....  x?i  ).  Rewriting, 


X1  *1  - X2  B2  ‘ X1  I - X2 


- = X*'  | - x*'  (X ' X)  ” 1 X ' Y 


ni;n2 

Z a.y. 

i=l  11 


where  the  coefficients  ai  are  the  elements  of  the  1 x (n^n,)  vector 


X*' (X'X)_1X' . Thus, 


t.2  = X*' (X'X)_1X' [X*' (X'X)_1X'] ' = X*’ (X'X)'1X’X(X’X)'1X*  = X*'(X'X)_1X* 


Now  X*'  = X* ' I-  X*  and  (X'X)  1 


(X1x1) 


(X2X2)' 


so  X*'(X’X)'1X*  = x*cx1x1)-*x*  + x*  (X2X2)  "X*  . 


a "2  2 1 -1 

As  before,  Var(<|/)  has  the  unbiased  estimate  o„  = s X*  (X’X)  X*, 

where  s2  is  now  the  pooled  estimate  of  the  variance,  and  with  probability 
1 - a simultaneously  for  all  ip  e L 


• ^ 


with  S = 


XJ  - X*  02  - So.  < y*  - y*  < XJ  Bj  - X*  8,  ♦ So. 

ip  ip 


(P1  + P2)Fa[pl  + p2 ’ nl  + n2  " (P1  + P2}1 


To  illustrate  one  of  the  most  useful  potential  applications  of  this 
result  consider  the  situation  where  we  are  presented  with  two  sets  of  data, 

*(xiryll)’  (-x12’y12-)  ’ • • ■ ’ (xln*yln^  ^x21’y21^*  ('X22’y22^  ’ ’ ' ' ’ (-x2m’y2mJ  ! ’ 
collected  from  the  same  process;  and  we  want  to  say  something  about  the 
similarity  or  dissimilarity  of  the  two  descriptions.  Suppose  each  set  is 

fitted  with  a quadratic;  and  we  construct  the  confidence  bound  about  the 
* * 

difference  y^  - y2>  as  shown  in  Figure  2.  Over  the  region  (1.65  4x4  2.05), 

where  the  confidence  bounds  cover  the  line  y = 0,  we  will  say  the  two  des- 
criptions are  consistent,  although  the  associated  probability  level  can- 
not be  attached  without  qualification  and  interpretation. 

The  extension  of  Grubbs  result  (2.1)  to  this  case  is  direct;  the 
bounds  take  the  form 


1 61  - X2  e2  1 (P1  + P2)Fa(pl  + p2’  nl  + n2  - pl  - P25 


j ! n (X*  - X ) n2(X*  - X2) 

• — + — + + 

nl  n2  A1  A2 

XX  XX 

where  S 2 is  the  pooled  estimate  of  variance  and  A^  is  computed  from  the 
i-th  data  set. 

4.  THE  k- SAMPLE  CASE 

The  straightforward  generalization  to  k sets  of  data  proceeds  as 
follows : 


1 1 

0,0, 


► 


and 


*■  c 


(X* xx) ■ 


cixi' 


. , ckX*'  and  (X'X) -1  = 


<W 


so  CX*' (X'X)_1CX*  = I c^xr ’ (x^xi)’1xr . The  confidence  region  now  assumes 


the  form 


Ec.X?'^  - S sCX*' (X'X)_1CX*  1/2  < E^y?  < E^X?  8.  + S sCX*  (X’X)*  CX* 


with  S = Zpi*Fa(Epi,Z:ni  - Ep^ 


For  the  linear  case  we  obtain 


cx*' (x'x)-1cx*  = i c2xr ' (X!Xi)'1Xt 
i=l 


^ 2 1 , y ,2 

= .Vi  — + ni(xo  - V * 

1=1  1 


and  the  two  sample  case  (Section  3)  is  obtained  by  setting  c^  - 1 and 


c^  = -1. 


* ; 


J 
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